Model Selection

Text generation optimization

# Text generation optimization

Deepseek R1 Distill Qwen 14B GRPO Taiwan Spirit

This is a fine-tuned version based on the Qwen-14B model, trained using the GRPO method, suitable for text generation tasks.

Large Language Model

A 500-million-parameter text generation model based on the Llama architecture, specifically designed for story creation.

Text Generation

Latitudegames.muse 12B GGUF

Muse-12B is a text generation model with 12B parameters, developed by LatitudeGames, aiming to provide high-quality text generation capabilities.

Large Language Model

MagTie-v1-12B is a 12B-parameter language model merged using the DARE TIES algorithm, combining the strengths of multiple pre-trained models

Large Language Model

Mistral Small 24B Instruct 2501 GGUF

Mistral-Small-24B-Instruct-2501 is a 24B-parameter instruction-finetuned large language model supporting multilingual text generation tasks.

Large Language Model Supports Multiple Languages

Salesforce.llama Xlam 2 70b Fc R GGUF

Llama-xLAM-2-70b-fc-r is a large language model released by Salesforce, based on the Llama 2 architecture with 70 billion parameters.

Large Language Model

MT Gen10 Gemma 2 9B

This is a multi-model fusion version based on the Gemma-2-9B series models, merged using the DARE TIES method, integrating the strengths of multiple Gemma variants.

Large Language Model

Mtmme Merge Gemma 2 9B

A text generation model merged from Gemma-2B and Gemma-9B models using the SLERP method

Large Language Model

Irix 12B Model Stock

This is a result of merging multiple 12B-parameter-scale language models using the mergekit tool through the model inventory method

Large Language Model

L3.3 Cu Mai R1 70b

A 70B-parameter large language model based on the Llama3 architecture, specially optimized

Large Language Model

Llama 3.2 1B Instruct GGUF

The GGUF format version of Llama-3.2-1B-Instruct, providing broader support and better performance.

Large Language Model

Mistral NeMo Minitron 8B Base IMat GGUF

This is the result of llama.cpp imatrix quantization based on the nvidia/Mistral-NeMo-Minitron-8B-Base model, providing more options for model usage and deployment.

Large Language Model

Wizardlm 2 7B Abliterated GGUF

Llamacpp imatrix quantized version of WizardLM-2-7B-abliterated, offering multiple quantization options for different hardware configurations.

Large Language Model

LOLA is an ultra-large-scale multilingual large model based on the sparse Mixture-of-Experts (MoE) Transformer architecture, supporting over 160 languages, with competitive advantages in natural language generation and understanding tasks.

Large Language Model

Transformers Other

FuseLLM-7B is a unified model that integrates knowledge from multiple open-source large language models, combining the capabilities of LLMs with different architectures through knowledge fusion technology.

Large Language Model

Transformers Supports Multiple Languages

Caplattessdolxaboros Yi 34B 200K DARE Ties HighDensity

This is a high-density merged model based on the Yi-34B-200K foundation model, integrating multiple homologous models through the DARE Ties method, featuring 200K long-context processing capability.

Large Language Model

Transformers English

Tiny Vicuna 1B GGUF

Tiny-Vicuna-1B is a lightweight model fine-tuned from TinyLLama 1.1B using the WizardVicuna dataset, designed for early-stage experimental iterations.

Large Language Model

Spelling Correction Multilingual Base

An experimental model for correcting spelling errors and punctuation in English and German

Text Generation

Transformers Supports Multiple Languages

Flan T5 Xxl Sharded Fp16

FLAN-T5 XXL is a variant of Google's T5 model, fine-tuned on over 1,000 additional tasks, supports multiple languages, and outperforms the original T5 model.

Large Language Model

Reward Model Deberta V3 Large

This reward model is trained to predict which generated answer human evaluators would prefer for a given question.

Large Language Model

Transformers English

Gpt2 Medium Dutch Embeddings

A Dutch language model based on the medium-scale GPT-2 version, with only the vocabulary embedding layer retrained for Dutch adaptation.

Large Language Model Other

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase